Search Results
Stanford CS25: V1 I Transformer Circuits, Induction Heads, In-Context Learning
A Walkthrough of In-Context Learning and Induction Heads Part 1 of 2 (w/ Charles Frye)
Stanford CS25: V1 I Decision Transformer: Reinforcement Learning via Sequence Modeling
Catherine Olsson - Induction Heads
Stanford CS25: V1 I Self Attention and Non-parametric transformers (NPTs)
Understanding ICL: Induction Heads (Natural Language Processing at UT Austin)
SLT Summit 2023 - Induction Heads and Phase Transitions (Mech Interp 2)
EleutherAI Interpretability Reading Group 220423: In-context learning and induction heads
Stanford CS25: V1 I Transformers in Language: The development of GPT Models, GPT3
A Walkthrough of A Mathematical Framework for Transformer Circuits
Attention - General - Copying & Induction heads [rough early thoughts]
Stanford CS25: V1 I DeepMind's Perceiver and Perceiver IO: new data family architecture